Neural Network Classifiers for Optical Chinese Character Recognition

نویسندگان

  • Richard Romero
  • Robert Berger
  • Robert Thibadeau
  • David Touretzky
چکیده

We describe a new, publicly accessible Chinese character recognition system based on a nearest neighbor classifier that utilizes a number of sophisticated techniques to improve its performance. To increase throughput, a 400dimensional feature space is compressed through multiple discriminant analysis techniques to 100 dimensions. Recognition accuracy is improved by scaling these dimensions to achieve uniform variance. Two neural network classifiers are compared using the new feature space, Kohonen’s Learning Vector Quantization and Geva and Sitte’s Decision Surface Mapping. Experiments with a 37,000 character ground truthed dataset show performance comparable to other systems in the literature. We are now employing noise and distortion models to quantify the robustness of the recognizer on realistic page images.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Optical Chinese character recognition using probabilistic neural networks

Building on previous work in Chinese character recognition, we describe an advanced system of classification using probabilistic neural networks. Training of the classifier starts with the use of distortion modeled characters from four fonts. Statistical measures are taken on a set of features computed from the distorted character. Based on these measures, the space of feature vectors is transf...

متن کامل

Neural Network Based Recognition System Integrating Feature Extraction and Classification for English Handwritten

Handwriting recognition has been one of the active and challenging research areas in the field of image processing and pattern recognition. It has numerous applications that includes, reading aid for blind, bank cheques and conversion of any hand written document into structural text form. Neural Network (NN) with its inherent learning ability offers promising solutions for handwritten characte...

متن کامل

Handwritten Character Recognition for Tamil and English Using Wavelet Transform

Intensive research has been done on optical character recognition (OCR) and a more number of articles have been published on this topic during the last few decades. Many commercial OCR systems are available in the market. But most of the systems work for Roman, Chinese, Japanese and Arabic characters. There is no sufficient number of works on Indian language especially Tamil language. Recogniti...

متن کامل

Font Recognition of Chinese Character Based on Multi-Scale Wavelet

Optical character recognition system research has been acquired howling success, but the reconstruction of layout needs fonts of the characters. In this paper, a novel font recognition algorithm is proposed, which is based on multi-scale wavelet analysis. We adopt wavelet analysis and the grid method to deal with the character image, and extract wavelet energy density feature, and apply the BP ...

متن کامل

Multi-class SVM Classifier With Neural Network For Handwritten Character Recognition

The paper describes the process of character recognition using the Multi Class SVM classifier combined with a neural Network approach. The character recognition techniques or the OCRs are either a printed document recognition or the handwritten character recognition. SVM (Support Vector Machine) classifiers often have superior recognition rates in comparison to other classification methods. In ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1995